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Decorrelation of Signals 



This invention relates to decorrelation of signals; more particularly it relates to 
decorrelation of signals that have undergone convolutive mixing, and to a method, an 
apparatus and a computer program for implementing decorrelation. 

5 Decorrelation is known: it is a form of signal processing implemented by software running 
on a computer system and accepting data from sensors after conversion from analogue to 
digital form. Two assumptions are normally made in decorrelation, stationarity and 
linearity, and these assumptions are also made in connection with the present invention. 
Stationarity means that signals and channels in which they mix do not change over a time 
10 interval during which mixed signals are sampled. Linearity means that mixtures of signals 
received by sensors are linear combinations of these signals. More complicated 
combinations featuring signal products and squares and higher order powers of signals 
are not considered. 

The aim of decorrelation is to remove similarities between input signals, thereby 
15 eliminating redundancies. In some circumstances this may lead to recovering the signals 
as they were prior to mixing, but more usually it is used as a pre-processing step in an 
unmixing algorithm, or as part of another algorithm such as a data compression algorithm. 
A common usage is to separate the signals which carry meaningful data from those that 
arise due entirely to noise in the environment and in a receiver system which detects the 
20 signals: such separation may be an end in itself, or it may be used as part of a data 
compression scheme. 

Decorrelation is a process of removing correlations or similarities between signal pairs in a 
set of signals: correlation of a pair of signals is itself defined mathematically as an integral 
of the signal pair's product of over time. It is important that as little information as possible 
25 (preferably none) is destroyed in decorrelation process, as would occur for example if 
decorrelation led to one of the signals being zeroed. Use of energy preserving transforms 
for decorrelation avoids information being destroyed and leads to output signals with other 
useful properties. 

30 If an energy preserving transform is used, it is usually required to recover decorrelated 
signals in decreasing order of power. This allows data compression techniques and 
signal/noise subspace separation techniques to be implemented very simply. 
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Algorithms for decorrelating signals that have been instantaneously mixed are known:. they 
may use either transforms that are energy preserving or those that are not. 

Unfortunately, an algorithm which is adequate for decorrelating signals that have been 
5 instantaneously mixed cannot cope with more difficult problems. These more difficult 
problems occur when an output signal from a sensor must be expressed mathematically 
as a convolution, i.e. a combination of a series of replicas of a signal relatively delayed 
with respect to one another. It is therefore referred to as the "convolutive mixing" problem. 

The approach used in instantaneous algorithms has been extended to the convolutive 
mixing situation: from this approach it has been inferred that convolutively mixed signals 
can be decorrelated by the use of an energy preserving technique. This algorithm would 
accommodate time delays involved in mixing and decorrelation. The criterion to be 
imposed has however changed: iie. pointwise decorrelation (as defined later) is longer 
sufficient, instead a wider condition involving decorrelation with time delays should be 
imposed. This will be referred to as wide sense decorrelation, and imposing this property 
as imposing strong decorrelation. 

Instead of the unitary matrix employed in instantaneous algorithms, an algorithm for 
imposing strong decorrelation by an energy preserving filter employs a paraunitary matrix. 
As will be described later in more detail, a paraunitary matrix is one which gives the 
20 identity matrix when multiplied by its paraconjugate matrix - a polynomial equivalent of a 
Hermitian conjugate matrix. A possible approach for the strong decorrelation problem is 
therefore to search for a paraunitary matrix that maximises a measure of the wide sense 
decorrelation. 

In this case, it may be possible to improve on the mere return of signals in decreasing 
25 order of power. It may be possible to provide for return of signals in decreasing order of 
power at each frequency in a set of frequencies. Thus a first signal has more power than 
other signals at all frequencies and a second signal has more power than all other signals 
apart from the first at all frequencies etc. This property is called spectral majorisation. 
Spectral majorisation is useful if the algorithm is to be used for data compression or for 
30 separating signal and noise subspaces. 
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In "Theory of Optimal Orthonormal Filter Banks" ICASSP 1996, P.P.Vaidyanathan 
discloses a use for filtering techniques where both strong decorrelation and spectral 
majorisation are necessary, neither being sufficient without the other. 

The simplest prior art method of imposing strong decorrelation involves using a 
5 multichannel whitening lattice filter. See for example S. Haykin, "Adaptive Filter Theory, 
Prentice Hall, 1991, although there are may references for these techniques, based on 
slightly different algorithms designed with different convergence and stability properties in 
mind. These techniques were not designed to impose strong decorrelation directly: instead 
they aim to recover an innovations sequence of the mixed signals?/ The comppnents of 
10 the innovations sequence so produced are strongly decorrelated as a consequence of 
what they are. However these techniques are not energy preserving, and so are not 
suitable for use in many scenarios. 

In "Principal Component Filter Banks for Optimal Multiresolution Analysis?', IEEE 
transactions on Signal Processing Vol 43 August 1995, M.K. Tsatsanis and G.B. 

15 Giannakis discuss how to find an optimal energy preserving and spectral majorising filter. 
This reference uses a frequency domain technique to show how an optimal filter can be 
found if infinite information is held about the signals and no constraint is placed upon the 
order of the filter. However it does not show either how this can be transferred to a data 
dependent algorithm, or how the filter can be constrained to a sensible order without 

20 losing optimality. However its calculations provide absolute performance bounds, and 
formed the basis of several of algorithms discussed below. 

In u Multirate Systems and Filter Banks?, Prentice Hall: Signal Processing Series, 1993, 
P.P. Vaidyanathan discloses parameterisation of paraunitary matrices in a stage-by-stage 
decomposition of a paraunitary matrix in z' 1 : here z A is an operator implementing a 
25 delay. Vaidyanathan shows that a product matrix built up from a series of pairs of 

paraunitary matrix blocks is paraunitary: here, in a pair of blocks, one block represents a 

» 

delay and the other a 2 by 2 unitary matrix implementing a Givens rotation (see US Pat No 
4,727,503). Vaidyanathan also shows that a paraunitary matrix of degree N, defined later, 
is a product of N+1 rotations and N one-channel delay operators all implementing the 
30 same unit delay. 

In "Attainable Error Bounds in Multirate Adaptive Lossless FIR Filters?', ICASSP 1 995, P.A. 
Regalia and D Huang propose two different eigenvalue-based algorithms for strong 
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decorrelation. Their algorithms are recursive, limited to two input signals and based upon 
a fixed degree lossless filter with parameters disclosed by Vaidyanathan. It is also 
necessary to access data from various points internal to the Vaidyanathan decomposition 
for use in adjusting the parameters: this adds an extra layer of processing complexity. 

5 In "Rational Subspace Estimation using Adaptive Lossless Filters?' IEEE transactions on 
Signal Processing Vol 40 Oct1992 P.A. Regalia and P. Loubatton disclose a different 
algorithm for attempting to impose both strong decorrelation and spectral majorisation. 
This algorithm is applicable to more than two signals, but it is based upon a deflationary 
approach where an optimal single signal output is first calculated, and then augmented 
10 with other outputs until one of the latter becomes insignificant. Again a fixed degree and 
fixed order lossless filter is parameterised as disclosed by Vaidyanathan, and the 
parameters are adjusted according to a stochastic algorithm, which requires information 
on the data flow inside the Vaidyanathan decomposition. 

A different algorithm is proposed in "Design of Signal-Adapted FIR Paraunitary Filter 
15 Bank? ICASSP 1996 by P. Moulin, M. Anitescu, K.O. Kortanek and F. Potra. This uses a 
slightly different version of the Vaidyanathan parameterisation with a fixed degree. From 
the effects of a parameterised filter on one single output channel, this algorithm shows 
how to convert a problem of maximising power in one channel into a linear semi-infinite 
programming problem, which has known methods for solution. In effect the algorithm has 
20 an objective of achieving both strong decorrelation and spectral majorisation; it has a cost 
function which is very similar to a known N* cost function to be described in more detail 
later. However it forces the paraunitary filter to be of a fixed degree, and relies upon a link 
between parameters and cost function being computationally tractable. For different cost 
functions this would not be the case. 

25 Gradient descent methods are known which aim to adjust all parameters of a paraunitary 
matrix or an unmixing filter simultaneously: these methods have another difficulty, i.e. they 
have parameters linked to any useful measure of independence in a very complex way 
which does not factorise easily. This means adjusting all parameters at once: it leads to 
very slow algorithm convergence, which is why none of the algorithms previously 

30 mentioned use gradient descent. Gradient descent tends to be avoided in the prior art due 
to problems with speed and accuracy of convergence. 
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It is an object of the invention to provide an alternative method of strongly decorrelating 
signals. 

The present invention provides a method for strong decorrelation of signals which are 
input, characterised in that it includes the steps of :- 

5 a) processing the input signals to determine delay and rotation parameters which 
implement at least one elementary paraunitary matrix and transform the input 
signals into output signals to obtain improvement in a measure of strong 
decorrelation; 

b) assessing the improvement in the measure of strong decorrelation, and if it is 
10 significant designating the output signals as input signals and iterating step a) and 

this step b), but if it is not significant moving to step c); 

c) designating the output signals as signals decorrelated in a wide sense. 

The invention provides a technique for wide sense decorrelation of signals which have 
been convolutively mixed, even if mixed using unknown filters. It is suitable, for example, 
for signals which have undergone multipath reflection, or acoustic signals in a reverberant 
environment. It is believed that the method of the invention is more reliable and requires 
less calculation than any alternative currently known. By virtue of use of an elementary 
paraunitary matrix, decorrelation is carried out using an energy preserving transform, and 
examples of the invention may (not necessarily but if required) impose spectral 
majorisation on the signals as well as imposing wide sense decorrelation. 

To transform the input signals, the method of the invention may determine in step a) delay 
and rotation parameters which characterise a single elementary paraunitary matrix. It may 
include producing a paraunitary matrix by cumulatively multiplying successive elementary 
paraunitary matrices produced by iterating step a). 

25 The range of signal delay parameters may be a set of discrete delay vectors, and the 
delay and rotation parameters may be determined by generating a respective version of 
the input signals delayed by each delay vector in the set, and for each version finding 
rotation parameters which at least approach producing maximisation of the output signals' 
strong decorrelation. The rotation parameters which at least approach producing 

30 maximisation of the output signals' strong decorrelation may be determined using an 
algorithm for instantaneous decorrelation. 
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The method may involve n input signals where n is an integer greater than 2, the range of 
signal delay parameters being a set of n-element delay vectors and the range of signal 
rotation parameters being a set of n(n - 1 )/2 angle parameters. Step a) may comprise 
determining delay and rotation parameters which implement at least one elementary 
5 paraunitary matrix providing for rotation of a pair of input signals and relative delay of the 
or as the case may be each other input signal. The n input signals may be associated with 
respective channels, step a) having n(n- 1)/2 successive stages each associated with at 
least one respective elementary paraunitary matrix and each providing for rotation of 
signals associated with a respective pair of channels and provision of relative delay 
10 associated with the or as the case may be each other channel, the first stage being 
arranged to process the input signals and the or as the case may be each subsequent 
stage being arranged to receive signals processed in the respective preceding stage. 



The method of the invention may involve a set of n input signals where n is an integer 
greater than 2, characterised in that it comprises: 
15 a) producing n(n - 1 )/2 replicas of the set of input signals, 

b) in each replica selecting a respective signal pair differing to that selected in other 
replicas, and 

c) the step of processing the input signals to determine delay and rotation parameters 
being carried out for each replica and comprising 

20 i) determining delay and rotation parameters which implement at least one 

elementary paraunitary matrix providing for rotation of the respective selected 
signal pair only, and 

ii) determining which replica when transformed by the associated at least one 
elementary paraunitary matrix gives rise to transformed signals corresponding 
25 to improvement in a measure of strong decorrelation by at least a major part of 

a maximum extent obtainable over the replicas and designating these 
transformed signals as output signals. 



In another aspect, the present invention provides computer apparatus for strong 
decorrelation of signals, the apparatus being programmed for reception of input signals, 
30 characterised in that the apparatus is also programmed to:- 

a) process the input signals to determine delay and rotation parameters which implement 
at least one elementary paraunitary matrix and transform the input signals into output 
signals to obtain improvement in a measure of strong decorrelation; 



b) assess the improvement in the measure of strong decorrelation, and if it is significant 
designate the output signals as input signals and iterating a) and b), but if it is not 
significant move to c); 

c) designate the output signals as signals decorrelated in a wide sense. 

In a further aspect, the present invention provides a computer programme for 
implementing strong decorrelation of signals input to computer apparatus, characterised in 
that it has instructions for controlling computer apparatus to:- 

d) process the input signals to determine delay and rotation parameters which implement 
at least one elementary paraunitary matrix and transform the input signals into output 
signals to obtain improvement in a measure of strong decorrelation; 

e) assess the improvement in the measure of strong decorrelation, and if it is significant 
designate the output signals as input signals and iterating a) and b), but if it is not 
significant move to c); 

f) designate the output signals as signals decorrelated in a wide sense. 

The apparatus and computer programme aspects of the invention may include features 
equivalent to those mentioned above in connection with the method of the invention. 

In order that the invention might be more fully understood, an embodiment thereof will now 
be described, by way of example only, with reference to the accompanying drawings, in 
which:- 

Rgure 1 illustrates decomposition of a paraunitary matrix into degree-one paraunitary 
matrices in a prior art convolutive unmixing process; 

Figure 2 is a schematic block diagram of decomposition of a paraunitary matrix into 
elementary paraunitary matrices in a convolutive wide sense decorrelation 
process of the invention; 

Figure 3 is a schematic block diagram illustrating production of a paraunitary matrix from 
elementary paraunitary matrices in a process of the invention; 

Figure 4 illustrates production of elementary paraunitary matrices in a process of the 
invention; 



Figure 5 



illustrates decomposition of a paraunitary matrix into elementary paraunitary 
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matrices in a prior art wide sense decorrelation process involving more than two 
signal channels; and 

% Figures 6 and 7 schematically illustrate alternative wide sense decorrelation processes of 
the invention involving more than two signal channels. 

The present invention is arrived at by modifying and extending the prior art of both 
instantaneously decorrelating signals and of strongly decorrelating signals: this prior art 
will first be discussed in more detail to enable the invention to be better understood. 

Wide sense decorrelation is based entirely upon second order properties of data. The 
equality at (1) below holds if and only if the instantaneous decorrelation property holds for 
the pair of signals x p x 2 . 

^(x 1 x 2 ) = £(x I )£(x 2 ) (1) 

Here E is the expectation operator, and a product x^ 2 represents the signal obtained by 
multiplying each sample of a signal x t by a respective sample of a signal x 2 having the 
same sample time: this defines pointwise decorrelation of a pair of signals. 

Expectations can be estimated from sampled signals to determine whether or not 
Equation (1) holds for sampled signals. If Equation (1) holds for all signal pairs then the 
signals are referred to as being decorrelated. 

Mathematically, instantaneous mixing of two signals from sources (e.g. loudspeakers) and 
their receipt by two receivers can be represented as follows:- 

x 2 (t) = h 21 s 1 (t) + h n s 2 (t) ( ' 

Here loudspeaker outputs (original signals) reaching the two receivers at time (t) are 
represented by ^(Oand s 2 (t) : t is in units of an interval between successive digital signal 
samples (clock cycles), and has integer values 1, 2 etc. The path from each loudspeaker 
to each receiver and the receiver's reception characteristics give rise to a constant 
channel modulation. In the instantaneous mixing problem, this modulation is represented 
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by multiplier terms , where i = 1 or 2 indicates first or second receiver and j = 1 or 2 

indicates first or second loudspeaker respectively. First and second loudspeaker signals at 
time (t) are represented by x x (t)and x 2 (t). Each receiver signal is the sum of two 

loudspeaker signals, each multiplied by a respective coefficient. These equations can be 
5 represented more compactly in matrix-vector format as:- 

x(0 = Hs(0 (3) 

where s is a 2 by 1 element vector of two loudspeaker signals, H is a 2 by 2 element 
mixing matrix and x is a 2 by 1 element vector of two received signals, one at each 
receiver. H is referred to as the mixing matrix because it has matrix elements 
10 (coefficients) that operate on the loudspeaker signals and mix them to form the received 
signals. The sizes of s , x , and H will increase if more than two signals or more than two 
sensors are involved. As previously mentioned, algorithms for solving the instantaneous 
decorrelation problem produced by Equation (3) exist in the prior art. 

Mathematically, a correlation matrix R of a set of signals expressed as a vector x can be 
15 defined as> 

R = E((x-E(x))(x-E(x)) H ) (4) 

where E(x) is the expectation of x, that is the mean of x over time and the superscript H 
means Hermitian conjugate. In Equation (4) E(x) is subtracted from x this is the first step 
in all processing that follows. However, to simplify notation it is usually assumed that 
20 signals have zero mean values, and this assumption will be used in what follows. "This 
means that Equation (4) simplifies to R = jE(xx h ). However, it is not necessary for 
signals to have zero mean values in order for the invention to be effective. 

To impose decorrelation upon mixed signals, it is necessary to force the correlation matrix 
R to have off-diagonal coefficients which are equal to zero. This is the instantaneous 
25 decorrelation problem. It can be done, for example, by ensuring that a correct amount of 
received signal 1 is subtracted from received signal 2 to ensure that the resulting output is 
decorrelated from received signal 1 . This is repeated for all other signals, producing a set 
of outputs decorrelated with received signal 1 , which is then defined as output 1 . The 
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remaining set of outputs is then decorrelated with each other by repeating this procedure 
using them. This is equivalent to obtaining decorrelated signals x' by pre-multiplying 
signals x by a lower triangular matrix L, i.e. a matrix having all above diagonal 
coefficients equal to zero. Instantaneously decorrelated signals are designated x' and 
have correlation matrix R' defined by:- 

R' = E(x'x' H ) = #(Lxx*L* ) = D (5) 

Here the new correlation matrix is shown as D, which is diagonal and positive semi- 
definite. 

In order that instantaneous decorreiation algorithms be capable of imposing second order 
independence, a relative propagation delay between a time sample of a signal from a 
signal source reaching two separate sensors must representable as a simple phase shift 
applied to a signal received by one of the sensors.., For N sensors this becomes N - 1 
phase shifts. If this condition does not hold, the channels from the signal sources to the 
sensors may be convolutive and in consequence instantaneous decorreiation algorithms 
will fail to impose second order independence. 

One-channel convolution and its analysis using z-transforms will now be considered. In a 
single signal, single receiver arrangement, the loudspeaker signal consists of a series of 
values indexed by the respective times t at which they were obtained and denoted by s(t) . 
The loudspeaker signal passes through a channel and is received by the receiver that 
detects another series of values, x(t) . The link between s(t) and x(t) may not be a 
simple multiplication as in instantaneous mixing. Instead x(t) may consist of a linear sum 
of several of earlier values of s(t) . This is known as a convolution, and is shown by:- 

x(t) = h® s(t) = £ h(k)s(t - k) (6) 

where ® is referred to as a "convolution operator". Generation of a linear sum of values of 
s(0may occur because signals pass through an irregular channel which spreads them 
out. It may also occur from multi-path effects, i.e. when there is more than one path a 
signal can take to reach a receiver and the paths have differing propagation times. 
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Instantaneous mixing is a simple case of convolutive mixing. 

Convolution is more complex than and involves more algebra than simple multiplication by 
a coefficient. However it is possible to reduce it to a multiplication by the use of z- 
transforms as follows. The z-transforms of s,x and h are:- 

s(z) = s(0) + ^l)*" 1 + s{2)z~ 2 + s(3)z~ 3 + — 
5 x(z) = jc(0) + x{l)z" 1 + x(2)z" 2 + Jt(3)z~ 3 + - • - (7) 

h( z ) = h(0) + h(l)z^ + h(2)z" 2 + • • • + 

Here z" 1 is referred to as a delay operator as mentioned above. The z-transform of a 
signal expressed as a time series of sample values is a polynomial in z~ l . Given the form 
of the z-transform, its polynomial coefficients enable recovery of the original signal. When 
in their z-transform forms, the convolution of s by h as shown in Equation (6) becomes a 
10 multiplication of two polynomials. Thus:- 

x(z) = h(z)s(z) (8) 

Convolutive mixing may be addressed by combining instantaneous mixing as per Equation 
(1) with the more complicated approach of Equations (6) to (8). This deals with a plurality 
of sources and a plurality of receivers with intervening channels which are convolutive. 
15 Thus for the two-signal, two-receiver case the following expressions for received signals 
can be written:- 

JCj(0 = *u®Ji(0 + *ia®*a(0 (g) 
x 2 (t)=h 2l ®s 1 (t) + h 22 ®s 2 (t) 

In the vector-matrix notation of Equation (3), and using the z-transform notation of 
Equations (7) and (8), Equation (9) can be written simply as:- 

20 x(z) = H(Os(s) 00) 

Here s(z) is a 2 by 1 coefficient vector formed from z-transforms of loudspeaker outputs. 
H(z) is a 2 by 2 coefficient polynomial mixing matrix and x(z) is the 2 by 1 vector formed 
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from z-transforms of received signals. Again if more than two loudspeakers and/or 
receivers are present, the sizes of s(z), x(z) and H(z) increase accordingly. 

H(z) is called a polynomial mixing matrix because it represents mixing of signals: its 

coefficients are polynomials in z" 1 . It may also be considered to be a polynomial in z" 1 
5 with coefficients which are matrices. It is found one coefficient at a time by taking z- 
transforms of individual loudspeaker/receiver channels. For future convenience, the order 
and degree of a polynomial matrix will now be defined. The order of a polynomial matrix is 
the greatest power to which z~ l is raised in the matrix. The degree of a polynomial matrix 
is the smallest number of delays necessary to implement it as a filter. It is always at least 
10 the same size as the order but may be greater. 

The instantaneous decorrelation problem only deals with removing correlation between the 
signals with no time delays applied to them. In order to impose wide sense decorrelation 
the measure of decorrelation has to operate across time delays. That is, to be strongly 
decorrelated, the correlation between two signals has to be zero when any time delay is 
15 applied to one of the signals. This can be written mathematically as:- 



where Vde ( 2,-1,0,1,2,- 0 means all values of delay d (in units of a time interval 

between successive signal samples) in the set of all positive and negative whole numbers, 
and D(<i) is a diagonal matrix (all off-diagonal coefficients zero) for all values of d . It is 
20 possible to put Equation (1 1 ) into z-transform form as follows:- 



where D(z) is a diagonal polynomial matrix. Equation (12) introduces the concept of a 
paraconjugate x(z)of a polynomial matrix x(z) denoted by a tilde above the matrix 
symbol x . It means combined operations of transposition and conjugation (as in the 
25 Hermitian operator) and the substitution of 1/z* for z . Paraconjugation is an extension of 
the Hermitian operator to polynomial matrices. 



R(d) = E(x(tMt-d H 



)) = D(d) W e 0 • ■ - 2 ,-1,0,1,2,'. ■ 0 (11) 



R(z) = *(x(*)x(z)) = D(z) 



(12) 



To achieve wide sense decorrelation it is necessary to force all off-diagonal terms of a 
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polynomial correlation matrix to become the zero polynomial, i.e. the polynomial in z 
having all its coefficients equal to zero. No constraints are put upon the polynomial 
functions of the matrix on its diagonal. Thus:- 



The unit polynomial is the polynomial that has zero coefficients for all powers of z 
except for z° which has unity as its coefficient. In the prior art a multichannel lattice filter, 
see e.g. S. Haykin, "Adaptive Filter Theory, Prentice Hall, 1991, imposes strong 
decorrelation while ensuring that the diagonal terms of the polynomiarcorrelation matrix 
10 are the same by forcing them all to be the unit polynomial. This whitens the signals. 

The polynomial correlation matrix of the signals x(z) is given by Equation (12) above. A 
multichannel lattice filter transforms these to obtain a matrix of strongly decorrelated and 
whitened signals x\z) . These signals have the property that their correlation matrix R'(z) 

is the identity polynomial matrix I, symbolically:- 



The fact that this method imposes the whitening condition upon the signals means that it 
carries out more calculation than is necessary and has not necessarily used an energy 
preserving transform. It can also lead to difficulties when original signals driving the mixing 
were not white, or when mixing nulls some of their frequency components, as it whitens 
20 the outputs and thus amplifies the noise in those frequencies that were nulled. 

Instantaneous algorithms that use energy preserving transforms achieve decorrelation by 
the use of unitary matrices. These are energy preserving, and are defined as those 
matrices that when multiplied by their Hermitian conjugate give the identity. The extension 
of the Hermitian operator to the space of polynomial matrices is the operation of 
25 paraconjugation. This leads to the definition of paraunitary matrices as those polynomial 
matrices that give the identity matrix when multiplied by their paraconjugate. Symbolically 

H(z) is a paraunitary matrix if and only if H(z)H(z) = H(z)H(z) = I . 



cr x (z) 0 
0 <r 2 (z) 



(13) 



R'(*) = *(*(z)r<z)) = I 



(14) 
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Paraunitary matrices are energy preserving and, more strongly, are energy preserving at 
all frequencies. This means that if decorrelation is achieved by the use of paraunitary 
matrices, the problems of whitening mentioned above are avoided. 

As previously mentioned, the Vaidyanathan reference discloses parameterisation of the 
5 space of all paraunitary matrices. This gives a stage-by-stage decomposition of a 
paraunitary matrix in z' 1 . The parameterisation is illustrated in Figure 1 for a two-channel 
case. It comprises a series of IM+1 rotation blocks Q 0 , Qi, ... Q N , adjacent pairs of these 
blocks being separated by individual delay blocks A(z) which are all alike. Upper and lower 
signal channels 20 and 22 pass through all the blocks. Amplifiers 24 indicate channel 
10 input signal scaling factors a, of which the upper channel factor is positive and the lower 
channel factor is positive or negative as indicated by "+-" before "a". 

Each of the rotation blocks Q 0 to Q N implements a respective 2 by 2 unitary matrix which 
itself implements a Givens rotation (see US Pat No 4,727,503): a Givens rotation is a 
rotation parameterised by a single rotation angle 0 . In the first rotation block Q 0 , a signal 

15 in the lower channel 22 is multiplied by Givens rotation parameters s 0 and c 0 ; a signal in 
the upper channel 20 is multiplied by Givens rotation parameters -s 0 and Co ; s 0 and c 0 are 
respectively the sine and cosine of the rotation angle 6 implemented by the block Q 0 . 
Each c 0 product is summed with the s 0 or -s 0 product from the other channel to provide a 
sum which passes along the respective channel 20 or 22 to the next block A(z). This next 

20 block A(z) delays the signal in the lower channel 22 with respect to that in the upper 
channel 20. This procedure is then repeated in subsequent rotation block Q 1 to Q N , with 
equal intervening delays A(z) in the lower channel 22. 

The rotation blocks Q 0 to Q N and delays A(z) are themselves paraunitary, and so their 
product is a matrix which is also paraunitary. Vaidyanathan proved that any finite degree N 
25 paraunitary matrix can be expressed in this form. Thus apart from a scalar factor, any 
finite degree paraunitary matrix can be expressed as follows:- 

H„ ( Z ) = Q„ • • • A(z)Q l A(z)Q 0 (1 5) 



where N is the degree of the paraunitary matrix. Thus a paraunitary matrix H N (z) of 
degree N is the product of N+l Givens rotations and N one-channel delay operators of 
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equal delays, as shown by the concatenated stages in Figure 1 . 



However, there is a difficulty with using Vaidyanathan parameterisation, a simple example 
of which is given here: consider signals which form an independent identically distributed 
(iid) time series. Assume these signals have been mixed by a paraunitary mixing matrix. 
Let the paraunitary mixing matrix have degree 1 and be formed, using Vaidyanathan 
parameterisation, as:- 



H„(z) = Q x A(z)Q 0 



= "1 OTl 0 Tcos(0 o ) sin(0 o )* 
" 0 ijo- z-'i-sin^o) cos(0 o ) 



(16) 



i.e. the last of the two Givens rotations (within the left hand pair of square brackets) is in 
fact the identity. It has been discovered in accordance with the invention that attempts to 

10 impose strong decorrelation on signals mixed by H„(z) should begin by undoing the 
delay, not by applying a rotation. However, in applying a single block type approach, the 
first step is to try to find an appropriate rotation to apply, even though one is not needed. 
This superfluous rotation is very difficult for later blocks to undo, and its application can 
introduce more correlation between the signals, which is precisely the opposite of the 

15 required decorrelation. 

The present invention is intended to overcome the difficulties indicated above, and to 
avoid superfluous calculations. It has similarities with Vaidyanathan's parameterisation but 
unlike the latter it employs variable delays not necessarily confined to one channel and not 
necessarily equal; moreover it does not force a rotation to be applied when there is little or 
20 nothing to gain, and allows arbitrarily sized paraunitary matrices to be constructed. 

Figure 2 illustrates the construction of a paraunitary matrix H„(z) as a product of N+l 
elementary paraunitary matrices in accordance with the invention. An elementary 
paraunitary matrix is an expression coined for the purposes of the present invention: it is 
defined as a polynomial matrix which represents applying a set of delays to the signals, 
25 followed by a single unitary transformation (rotation). In Figure 2 and as in Figure 1 , for the 
sake of simplicity, only the two-channel case is shown as indicated by upper and lower 
channels 30 and 31 . Each of a set of dotted line blocks Vj(z) (i = 0, 1 , to N) represents a 
respective single elementary paraunitary matrix that implements a respective delay dj 
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followed by a Givens rotation. N+l of the blocks V } (z) are used to make up the paraunitary 
matrix (z) . 

Each of the blocks V 0 (z) to V N (z) implements a respective 2 by 2 elementary paraunitary 
matrix which itself implements a Givens rotation. In the first block V 0 (z), signals in the 
5 upper and lower channels 30 and 31 pass into a matrix of four ganged single pole double 
throw switches 32 controlling routing or otherwise via a delay cell Z?°. The switches 32 are 
all either in the UP position or in the DOWN position: when DOWN, upper channel signals 
are delayed at Z d0 but lower channel signals are not, and when UP the reverse is the case. 
This enables either channel 30 or 31 to be delayed relative to the other. 

10 On leaving the switches 32, an upper channel signal is multiplied by Givens rotation 
parameters -s 0 and c 0 at amplifiers 33 and 34 respectively. Similarly, on leaving the 
switches 32, a lower channel signal is multiplied by Givens rotation parameters s 0 and c 0 
at amplifiers 35 and 36 respectively. Here s 0 and c 0 are respectively the sine and cosine of 
a rotation angie 0 0 implemented by the block V 0 (z). Each c 0 product is summed with the s 0 

15 or -s 0 product involving the signal from the other channel: this provides sums, which pass 
along respective channels 30 and 31 to the next block V^z). This deiay/rotate procedure is 
then repeated in subsequent blocks V^zjto V N (z). These later blocks will not be described 
further because they operate in the same way as the first block V 0 (z), except their delays 
and rotations will usually be different. Implementation of this procedure to find appropriate 

20 delays and rotations will be described later, following a theoretical discussion. 

A full paraunitary matrix parameterisation in accordance with the invention in terms of 
elementary paraunitary matrices can be expressed as follows:- 

h„(z) = (z)-v diA wv w w (17) 

The progress of imposing wide sense decorrelation upon signals in accordance with the 
25 invention uses a cost function that is calculated after every application of an elementary 
paraunitary matrix to the signals to determine whether or not an improvement has been 
achieved. The cost function is a measure of the signals 1 strong decorrelation, an example 
of which is a sum squared of second order measures of autocorrelation of the signals, and 
is at a maximum when all cross-correlation terms are at a minimum. Thus an increase in 
30 this cost function corresponds to a reduction in correlation between the signals. The cost 
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function will be referred to as the Decorrelation Cost Function (hereinafter "DCF"). 

The process of the invention is to find an appropriate paraunitary matrix that imposes wide 
sense decorrelation on the signals. This matrix is not parameterised as disclosed by 
Vaidyanathan, but it is instead parameterised in terms of elementary paraunitary matrices 
5 (see Equation (17)) calculated sequentially. Once derived, each elementary paraunitary 
matrix is applied to the signals used to calculate it: this transforms the signals into a form 
suitable for calculating the next such matrix. 

To calculate an elementary paraunitary matrix a set of possible delays is considered. A 
latest elementary paraunitary matrix is chosen which maximises the DCF calculated for 

10 signals transformed by application of all such matrices calculated up and including the 
latest. To identify the latest elementary paraunitary matrix its delay and rotation 
parameters are required. There is a countable infinity of different possible delays, but an 
uncountable number of different rotation parameters. Realistically the countable infinity of 
possible delays will be reduced to a finite number of different delays to be considered, 

15 either by constraining them or by making the assumption of circulant statistics and hence 
constraining them to be less than the data length (number of samples of each signal used 
in decorrelation). 

Once a trial value of delay has been applied to the signals, rotation parameters s 0 and c 0 
are chosen. These parameters could be chosen by using a technique which explicitly 
20 maximises the DCF of output signals. 

It is therefore possible to look at a range of trial delays and for each delay to attach the 
rotation that maximised the DCF after the application of the delay and the rotation. This 
produces a set of possible elementary paraunitary matrices, each associated with a 
respective DCF found by applying it to signals and finding the DCF of the resulting output. 
25 Under certain assumptions it may be possible to avoid applying the elementary 
paraunitary matrices to the signals, and instead to calculate the DCF of the resulting 
output directly. This has computational advantages, but is only possible in some cases. 

The elementary paraunitary matrix that is selected is that having the best DCF output. This 
stage is repeated until there is no gain in DCF to be found, or the gain in DCF is regarded 
30 as being too small to warrant the extra complexity in the strongly decorrelating matrix. 
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Thus the invention can be seen as a method of imposing wide sense decorrelation by 
sequentially applying a series of elementary paraunitary matrices each chosen to 
maximise an DCF measure in an output resulting from its use. Each elementary 
paraunitary matrix contains one rotation. Thus the process of the invention implements an 
algorithm referred to as the Sequential Best Rotation (SBR) algorithm. This algorithm 
comprises decomposition of the paraunitary matrix into elementary paraunitary matrices, 
and the use of a measure which can be meaningfully calculated at each stage of the 
algorithm not just at the end. The combination of these two steps separates the algorithm 
into a series of like stages that can be carried out in sequence. 

Strong decorrelation of signals is equivalent to a process of generating a strongly 
decorrelating matrix H(^). This process will now be described. It is not essential to 
generate H(z) , but it permits the process to be tested: 1 i.e. mixed signals which have 
undergone a prearranged form of convolutive mixing can be used to produce H(e) , and 
then H(z) , the paraconjugate of H(z) , can be compared with the prearranged form to 
see how faithfully the former reconstructs the latter. Moreover, it can be shown that under 
certain circumstances H(z) can be used to determine further information about original 
signals, such as arrival direction and frequency responses. 

Referring now to Figure 3, a flow diagram is shown illustrating an iterative process of the 
invention to produce a strongly decorrelating polynomial paraunitary matrix H(z) as the 
product of a series of elementary paraunitary matrices. At the beginning of each 
successive iteration of the process there is a respective current value h(z) of the evolving 
paraunitary matrix, i.e. the product of all elementary paraunitary matrices calculated up to 
the present time. There is also a set of current signals derived using the most recently 
calculated elementary paraunitary matrix incorporated in h(z): h(z) represents the 
transformation which converts the original input mixed signals to obtain the current 
signals: it is improved repeatedly until applying a further elementary paraunitary matrix 
results in no significant improvement in DCF, at which point h(z) has become H(z) . 

The process starts at 60, where the initial current signals are the original input signals, and 
the initial current value of the paraunitary matrix h(z) is the identity matrix I . At 61 the 
current signals are used to produce a current elementary paraunitary matrix; at 62 this 
matrix is applied to the current signals to produce new signals. At 63 the new signals are 
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tested to see whether their strong decorrelation properties have improved: if such an 
improvement has been obtained, at 64 the current elementary paraunitary matrix is used 
to pre-multiply h(z) to provide the latest current value of the evolving paraunitary matrix 
and the new signals become the latest current signals. The latest current value of h(z) 
5 and the latest current signals are then fed back to stage 61 for the procedure of stages 
61 , 62 and 63 to be repeated. 

If at 63 the new signals show no improvement over the current signals, the process is 
considered to have converged to a solution and it terminates at 65: the current value of 
h(z) is output as TH(z) and the current signals are output as the required strongly 
10 decorrelated signals: Thus the use of elementary paraunitary matrices divides the problem 
of strongly decorrelating signals and obtaining H(z) into a number of sub-problems. 

Methods for finding each elementary paraunitary matrix at 61 and for deciding at 63 
whether or not the new signals are improvements over the current signals will now be 
described. The second relies upon a measure of strong decorrelation of signals being 
15 used. An accurate and comprehensive decorrelation cost function (DCF) is impossible to 
calculate from sampled signals, but it is possible to calculate various different measures of 
strong decorrelation. Some examples of good decorrelation cost functions will now be 
given. They are based upon the correlation terms r(x x ,x 2 ) of a set of signals defined by: 



20 where E is the expectation operator and x 1 and x 2 are signals in the set: they may be 
different signals, or the same signals, and they may have delays applied to them. 

Decorrelation cost functions, denoted by Nf , are defined as the sum squared of all 

possible correlation terms in a set of signals. In the present example correlations are 
derived between each signal in the set and a relatively delayed replica of itself, i.e. there 
25 are no correlations between different signals: x t and x 2 in Equation (18) are therefore the 
same signal with a relative delay therebetween, and correlations are obtained for each of 
a set of delays up to a maximum delay of L. The maximum delay L can be regarded as an 
indication of the locality of the cost function. If L = 0 the cost function is referred to as a 
. pointwise cost function, if L is small, i.e. L<5, the cost function is referred to as a local cost 



r(%!,* 2 ) =E[x 1 x 2 ] 



(18) 
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function, otherwise it is a wide cost function. 

Changing the L value changes the properties of Nf, but for ail values of L this is a 
sensible decorrelation cost function. In particular, the two cases of L=0 and L=max (i.e. 
the largest value that can be measured using the sampled signals) have been successfully 
5 used. The L=0 DCF leads to an algorithm which imposes spectral majorisation, while the 
L=max DCF leads to an algorithm that only imposes wide sense decorrelation, and not 
spectral majorisation. The process of the invention may use any convenient DCF, 
provided it is an accurate DCF and is used consistently throughout: specific problems may 
suggest specific DCFs. 

10 At 63 current signals and new signals are compared by calculating their DCFs: the signals 
having the higher DCF are accepted as being the better of the two. 

To obtain the latest elementary paraunitary matrix at 61, it is necessary to derive two 
parameters which characterise such a matrix, a delay parameter d and a rotation 
parameter 9 . Thus identification of the elementary paraunitary matrix reduces to finding 
15 (d,0) . The parameter d is a vector parameter, and 6 can be a vector parameter if there 
are more than two signals. However in this example only the case of two signals is being 
considered, thus 8 is not a vector. 

To find the delay parameter d , a set D of possible values for it is selected. A suitable set 
for the two-signal case is shown below in Equation (22), i.e. D may be represented as: 

nWW W(o)(o) [ o }[ojj 

20 

where dots indicate terms not shown, each inner pair of parentheses represents a delay 
vector 5, having two elements, 0 in an upper or lower position in parentheses indicates an 

undelayed upper or lower signal channel respectively, and a non-zero value in such a 
position indicates a delayed channel located likewise: each non-zero value represents 
25 delaying the associated channel in each case relative to the other channel by an integer 
value not greater than A Delays are in units of a time interval'? between successive signal 
samples, e.g. a delay (A - 1) implies (A - 1)r. There are also other possibilities for D , but 
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before major departures from this scheme are implemented they should be assessed for 
validity. The number of elements or delay values in D is S , i.e. |£>| = S . 

Referring now to Figure 4, there is shown a flow diagram indicating how elementary 
paraunitary matrices can be found. For all delay vectors b x \o b s in D, respective 

5 elementary paraunitary matrices are derived, and then the DCFs of the signals these 
matrices create are calculated. The elementary paraunitary matrix corresponding to the 
best DCF is then selected for use. In the drawing, in the two-channel case a pair of current 
signals are input at 70. At 71 the signals are replicated S times, i.e. as many times as 
there are delay values in£>. In the two channel case each replica is a pair of current 

10 signals: each replica pair is input to a respective signal channel 72j of which there are S , 
i.e. i = 1 to S . Three channels are shown, 72^, 72 2 and 72 s , others being implied by chain 
lines such as 73. At 74i to 74 s , each of the channels 72, to 72 s applies a respective 
delay vector 5 X , ....5 5 from D to its replica, i.e. it delays one of the pair of current signals 
in its replica relative to the other: the ith channel 72\ applies delay vector 5 f to the ith 

15 replica, and i = 1 to S . In each of the channels 72^ to 72 s the relative delay produces a 
pair of signals that are different from those input at 70, although they are no more strongly 
decorrelated than before. 

To make these signals as decorrelated as possible in a pointwise sense using only a 
rotation can be achieved by using an instantaneous algorithm. A review of many methods 
20 of imposing instantaneous decorrelation, both with and without using energy preserving 
techniques, appears in "Matrix Computations", John Hopkins University Press 1989 by 
G.H. Golub and C.F. Van Loan. This covers a wide range of techniques developed over 
the last 150 years, and discusses numerical issues caused by the introduction of 
computers into the field. 

25 . 

Suitable rotations may be calculated by one of several different techniques. For many 
choices of DCF it may be possible to calculate the rotation that explicitly maximises the 
DFC of the signals afer the application of the rotation. It may also be possible to try to find 
this rotation by a numerical approximation method. A third possibility is to use a pointwise 
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decorrelation algorithm to find the roation that imposes pointwise decorrelation; this should 
lead to an improvement in DCF to at least a predominant part of the maximum 
improvement attainable In this regard. These three rotation finding possibilities may 
coincide, as they do for the N{ measure with L=0. 

5 Each channel 72j now has parameters associated with it consisting of a delay vector 8 f 

and a rotation angle e u just as if an elementary paraunitary matrix implementing those 
parameters had been applied to the signals input at 70: moreover, when applied to input 
signals this matrix provides a rotation which produces output signals that are as 
decorrelated as they can be for the relevant value of delay element 8, . The first two 
10 elements 74\ and 75j of each channel 72 { therefore in combination simulate an elementary 
paraunitary matrix with a respective parameter set (d, 0) = (8 f 

The next stage is to determine which of the channels 72 t to 12 s simulates an elementary 
paraunitary matrix producing the highest improvement in strong decorrelation. This is 
carried out at 76 t to 76 s , where DCFs of signals output from stages 75) to 75^ 

15 respectively are calculated. These DCFs are compared with one another at 77, and 
whichever channel produces the greatest DCF provides the highest improvement in strong 
decorrelation and simulates the best elementary paraunitary matrix: this matrix is selected 
as the output of the process indicated in Figure 4 for use at 61 in Figure 3. 

On making certain assumptions regarding the input signals it is possible to avoid most of 
20 the calculations in this stage. It may be possible to predict the DCFs of signal outputs 
from stages 75^ to 75 s . If this is possible then it is not necessary to carry out the 

calculations for any of the channels except the one that will lead to the greatest increase in 
DCF. In general if this approach is valid, it is worth doing as the computational savings 
will be considerable. An example of this technique is now given. If the cost function is Nf 

25 with L=0 then the maximum possible gain in DCF achievable for a given delay is simply 
the correlation between the two signals after that delay has been applied. By using the 
circulant assumption, this correlation can be calculated for all possible delays by using 
Fourier transforms. This is a prior art technique for calculating the cross correlation 
between two signals, and is computationally inexpensive compared with calculating the 

30 cross correlations for each delay individually. Thus the delay which leads to the maximum 
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cross correlation between the two signals can be selected, without having to apply delays 
and calculate rotations for each of the channels. 

It is not in fact essential to choose whichever channel produces the greatest DCF: another 
channel giving a value relatively close to this value would be acceptable. Also a channel 
might be selected to give a more convenient value for one of the parameters, e.g. a 
shorter delay. If however a channel is chosen giving a value insufficiently close to the 
greatest DCF, then the process of signal separation may fail because it might be 
impossible to rectify matters in subsequent iterations. So a channel should be selected 
which gives a DCF or improvement in decorrelation to at least a predominant part of the 
maximum in this regard obtainable over the ranges of delay and rotation parameters 
employed. 

The foregoing embodiment of the invention related to imposing wide sense decorrelation 
in a two-channel case, i.e. two signal sources, two receivers and two paths from each 
source to receivers. If there are more than two channels further processing is necessary. 
15 However the procedure of the invention remains fundamentally the same: i.e. find a 
paraunitary matrix to impose strong decorrelation on signals in the channels, the matrix 
being a product of successive elementary paraunitary matrices each of which maximises a 
measure of strong decorrelation of outputs resulting from its application to input signals. 

There are several ways of extending the invention to more than two channels. One is a 
20 simple, direct extension of the previous embodiment. Others use signal pairs, attempt to 
impose pairwise independence, and sweep through all signal pairs in some order. 

Figure 5 shows the decomposition of a paraunitary matrix with degree N for the multi- 
channel case (i.e. three or more channels) as given by the Vaidyanathan prior art 
previously mentioned. It is very similar to the two-channel case of Figure 1 . The difference 
25 is that Givens rotations, which are essentially two-channel operators, have been replaced 
with more general rotations operating on all channels at once. This can be expressed 
mathematically as:- 

H„ (*) = R„ - - A(z)R l A(z)R 0 (20) 



where as before A(z) is a matrix representing successive unit delays in the lowermost 
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channel, and R, (i = 0 to N) represents the ith rotation. In the n channel case each 

rotation has n(n-1)/2 parameters, which reduces to one parameter when n = 2 for Givens 
rotations used in the two-channel case. 

The parameterisation of Equation (20) involves the same problems as in the two-channel 
case described with reference to Figure 1 . Here again the procedure used to deal with the 
problems is the decomposition of a paraunitary matrix into elementary paraunitary 
matrices: the elementary paraunitary matrices are found by considering the decomposition 
and constraining all but the last rotation. The last rotation is unconstrained, but all others 
are constrained to either leave inputs unchanged or to reorder channels so a different 
channel receives a subsequent delay. Thus all rotations apart from the last are 
constrained to be permutations. The elementary paraunitary matrix can thus be expressed 
as:- 

V di e(O = R B A(z)-.P l A(z)P 0 =R e D d (z) (21) 

Here P. is the ith permutation matrix and R 0 is the rotation matrix parameterised by a 
vector of n(n-1)/2 rotation angles e. The combination of all permutation matrices and 
delay matrices is denoted by D d , which is referred to as the delay matrix. Its index, d, is a 
length n vector whose n elements are delays applied in respective channels. 

Equation (21) shows the extended form of an elementary paraunitary matrix. First a delay 
matrix is applied, which consists of differing delays being applied in all the channels. 
Secondly a rotation is applied. As before this is a good choice for an elementary building 
block, as it contains only one operation that mixes the signals. However, this form is 
parameterisable, the parameters being the vectors d and 9. This means that the 
decomposition of a paraunitary matrix into elementary paraunitary matrices can be written 
as:- 

H(«) = V dtf fif/ (z) ... V di(8i (z)V do , eo CO (22) 

The methodology for finding this expansion is largely the same as in the two-channel 
case. It only requires modification in selecting D , i.e. the delay set or set of possible 
delays for forming the first of the series of elementary paraunitary matrices. In the two- 
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channel case the elements of D were treated as delays applied to only one channel. In 
the more general case delays a,ve allowed in more than one channel. As in the two- 
channel case there is a countable infinity of possibilities, so a sensible finite set has to be 
selected to form D . 

The number of elements in D could be fixed by setting an upper limit denoted by I to the 
number of delays in it. Thus D can consist of all possible ways of allocating up to I 
delays between n channels. Applying the same delay to all channels at the same time is 
equivalent to applying no delay at all, so all elements in D should have at least one 
channel that is not delayed. The number of elements in D can grow very fast as n and I 
increase. As before this scheme for D is merely selection of a sensible option. Other 
possibilities exist but usually need more justification to ensure validity. It is possible to 
justify some choices (such as allowing delays to be applied to only one channel) by saying 
they allow a higher I for a fixed size of D . Thus they allow the algorithm to consider 
longer delays, at the cost of not allowing it to implement more complicated combinations 
of smaller delays. 

However it is defined, once D is fixed the process of the invention can proceed to impose 
wide sense decorrelation in the multi-channel case in a similar way to the two-channel 
case of the previous embodiment. There are two differences: firstly, the two-element delay 
vectors 5, in Figure 4 are replaced by delay vectors (elements of the delay set D) having 

n elements, where n is the number of channels. This allows the algorithm to apply an 
element of D to the signals. Secondly, the rotations calculated by known pointwise 
decorrelation algorithms at stages 75i to 75 s are now n channel rotations. These can be 

obtained by successively applying n(n-1)/2 two channel rotations, and so can be 
parameterised by n(n -l)/2 angle parameters: Prior art pointwise decorrelation algorithms 
previously mentioned provide current signals and a rotation matrix which generates them, 
so the process will not be described further. 

This approach to the multi-channel case may give difficulty over the size of the delay set 
D : because each element in D is a vector, for each element a whole n by n pointwise 
decorrelation algorithm has to be carried out. As the delay set upper limit I grows this 
becomes the dominant time constraint and slows processing down. So-called "sweeping" 
algorithms were developed in the prior art to attempt to mitigate similar problems in the 
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instantaneous case. 

The principle of using energy preserving transforms to zero the correlation between a set 
of signals has a long history. The first reference is "Uber ein Leichtes Verfahren Die in der * 
Theorie der Sacularstroungen Vorkommendern Gleichungen Numerisch Aufzulosen" 
5 Crelle's J. 30 p51-94, 1846 by C.G.J. Jacobi. He introduced the idea of tackling the 
multichannel case by sweeping through pairs of signals in such a way as to strictly 
decrease the total measure of correlation between different signals in the problem at each 
stage. This technique is known as using Jacobi sweeps 

It has been shown that cycling through different signal pairings and imposing pairwise 
10 decorrelation leads to good decorrelation of several signals, provided that there are 
repeated sweeps through all available signal pairs. The technique for decorrelating several 
signals by repeated sweeps through all available signal pairs imposing pairwise 
decorrelation on each pair is known as the Jacobi algorithm. At any one stage it only 
works on making two signals pairwise decorrelated. 

15 This example of the invention provides a procedure called the sweeping process for 
finding an elementary paraunitary matrix based algorithm that follows the same approach 
used in the Jacobi algorithm. The procedure has a prearranged ordering of all available 
signal pairings through which it works in a sweep. Each signal pair is processed using the 
two-channel process of the first embodiment of the invention. The remaining signals are 

20 aligned in time (delayed) so they have at least approximately the same delay applied to 
them as the signal pair being processed. At the end of a sweep the same termination 
condition from the previous embodiment of the invention can be applied to decide if 
another sweep needs to be carried out or if no more significant improvement is obtainable. 

Figure 6 illustrates this procedure for imposing wide sense decorrelation on three signals 
25 A, B and C. As mentioned earlier the process of the invention implements the Sequential 
Best Rotation or SBR algorithm. In the drawing, "SBR2" indicates application of the SBR 
algorithm for a two-channel case as in the previous embodiment. 

In an initial phase, signals A and B are processed with SBR2 at 90, and at 91 signal C is 
aligned in time for substantial synchronisation with the output from 90. Time alignment at 
30 91 is carried out by delaying C by half the order of the polynomial matrix determined at 90 
- i.e. C is delayed in time by half the sum total of the delays applied in the processing of A 
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and B irrespective of which channel the or (as the case may be) each delay lies in. 
Alignment of C in this way is not essential but has been found to give acceptable results. It 
is not necessary to allow for processing delay to implement SBR2 at 90, because the 
signals A, B and C are sequences of .digital data samples stored when not in use: 
5 "delaying" C relative to A or B merely means changing the sample number in a sequence 
input to a subsequent processing stage. 

At stages 90 and 91 in combination, at least one elementary paraunitary matrix is applied 
to all three signals: such a matrix can be generated by extending the or as the case may 
be each two channel elementary paraunitary matrix produced at 90 to cover three 
10 channels. The or each matrix applies rotation and delay to signals A and B but it applies a 
delay to signal C without rotating it. If more than one elementary paraunitary matrix is 
determined at 90, all delays applied to signal C must add up to the total delay applied to C 
at 91. 

In a second phase, signal B (after processing at 90) and signal C (after delay at 91) are 
15 processed with SBR2 at 92, and signal A (after processing at 90) is aligned in time at 93 
with the output from 92. In a third phase, signal A (after alignment at 93) and signal C 
(after processing at 92) are processed with SBR2 at 94, and signal B (after processing at 
92) is aligned in time at 95 with the output from 94. 

The next stage at 96 is to determine whether or not the signals have become sufficiently 
20 strongly decorrelated to warrant terminating the process,, or whether it is necessary to 
continue: as before this termination condition is satisfied if DCFs of the signals have not 
improved significantly: if so, processing terminates; if not a respective elementary 
paraunitary matrix determined at each of 90, 92 and 94 is used to pre-multiply a stored 
product of those preceding it and the latest resulting product replaces that previously 
25 stored; the latest current signals from SBR2 94 and alignment 95 are then fed back to 90 
and 91 via a feedback loop 97 for the procedure to be repeated. 

In this embodiment, one or more two-channel elementary paraunitary matrices may be 
applied in the SBR2 stages at 90, 92 and 94 for each circulation round the loop of items 
90 to 97. It is possible to apply as many elementary paraunitary matrices at these stages 
30 as one wishes, up to the number necessary to meet the termination condition for the 
SBR2 algorithm. It is preferred to apply only one elementary paraunitary matrix at the 
stages 90, 92 and 94 because it reduces the need for signal alignment at 91 , 93 arid 95: it 
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uses the philosophy of applying the more important rotations first, because earlier 
rotations in the SBR2 process tend to give more performance gain than later equivalents. 

If, rather than operating on signal pairs in a fixed order, the pair that leads to the greatest 
DCF increase is operated on first, another approach for imposing wide sense 
5 decorrelation can be produced. This approach also operates on pairs, but pair ordering is 
not fixed. Instead all possible pairs are considered and the one that leads to the greatest 
improvement in DCF is accepted. This is known as a searching process, as it searches 
through possible signal pairs. Figure 7 illustrates the procedure for three signals A, B and 
C. The signals are replicated n(n - 1 )/2 times where n is the number of signals: n = 3 in 

10 the present example, which coincidentally means there are three replicas of the signals. In 
an initial phase, using a first replica of the three signals A, B and C, at 100 signals A and B 
are processed with SBR2, and signal C is aligned in time with the output so produced. 
Here again a (or as the case may be each) two channel elementary paraunitary matrix 
produced by the application of SBR2 to two signal channels is extended to an n channel 

15 elementary paraunitary matrix that delays (but does not rotate) the or each remaining 
signal C to preserve time alignment with SBR2-processed signals A and B. 

For a second replica, at 101 signals A and C are processed with SBR2, and signal B is 
aligned with the output. For a third replica, at 102 signals B and C are processed with 
SBR2 at 94, and signal A is aligned with the output. At 103, the DCFs of the three output 
20 signal groups are determined: the group with the greatest DCF is selected, and if the DCF 
shows an improvement over the input signals to this stage, then the operation is accepted. 

As before, if the DCF of all the signals has been improved, processing continues with new 
current signals and new current matrix being fed back via a loop 104 to 100, 101 and 102. 
If there has been no significant improvement, processing terminates and at 105 the 
25 current signals and paraunitary matrix are output. 

Again the SBR2 steps 100, 101 and 102 may consist of applying a full SBR2 process of 
calculating a series of elementary paraunitary matrices until no improvement is obtained, ' 
or just finding one elementary paraunitary matrix. The latter case is equivalent to a 
restricted version of a full SBR: i.e. ignoring time alignment, the delays are restricted to 
30 only delaying one channel and the rotations to only applying a rotation to the delayed 
channel and one other. This reduces the size of the delay set and hence increases the 
speed of computation of elementary paraunitary matrices. 
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The equations and procedures given in the foregoing description can clearly be 
implemented by an appropriate computer program comprising program instructions 
recorded on an appropriate carrier medium and running on a conventional computer 
system. The carrier medium may be a memory, a floppy or compact or optical disc or 
other hardware recordal medium, or an electrical signal. Such a program is straightforward 
for a skilled programmer to implement from the foregoing description without requiring 
invention, because it involves well known computational procedures. 
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CLAIMS 

1 . The present invention provides a method of strong decorrelation of signals which 
are input, characterised in that it includes the steps of:- 

a) processing the input signals to determine delay and rotation parameters 
which implement at least one elementary paraunitary matrix and transform 
the input signals into output signals to obtain improvement in a measure of 
strong decorrelation; 

b) assessing the improvement in the measure of strong decorrelation, and if it 
is significant designating the output signals as input signals and iterating 
step a) and this step b), but if it is not significant moving to step c); 

c) designating the output signals as signals decorrelated in a wide sense. 

2. A method according to Claim 1 characterised in that the delay and rotation 
parameters which transform the input signals characterise a single elementary 
paraunitary matrix. 

3. A method according to Claim 2 characterised in that it includes producing a 
paraunitary matrix by cumulatively multiplying successive elementary paraunitary 
matrices produced by iterating step a). 

4. A method according to Claim 2 characterised in that the range of signal delay 
parameters is a set of discrete delay vectors, and the delay and rotation 
parameters are determined by generating a respective version of the input signals 
delayed by each delay vector in the set, and for each version finding rotation 
parameters which at least approach producing maximisation of output signals' 
strong decorrelation. 

5. A method according to Claim 4 characterised in that the rotation parameters which 
at least approach producing maximisation of output signals' strong decorrelation 
are determined using an algorithm for pointwise decorrelation of the kind used in 
instantaneous decorrelation. 

6. A method according to Claim 1 involving n input signals where n is an integer 
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greater than 2, characterised in that the range of signal delay parameters is a set 
of n-element delay vectors and the range of signal rotation parameters is a set of 
n(n - 1 )/2 angle parameters. 

7. A method according to Claim 1 involving n input signals where n is an integer 
greater than 2, characterised in that step a) comprises determining delay and 
rotation parameters which implement at least one elementary paraunitary matrix 
providing for rotation of a pair of input signals and relative delay of the or as the 
case may be each other input signal. 

8. A method according to Claim 7 wherein the n input signals are associated with 
respective channels characterised in that step a) has n(n - 1 )/2 successive stages 
each associated with at least one respective elementary paraunitary matrix and 
each providing for rotation of signals associated with a respective pair of channels 
and provision of relative delay associated with the or as the case may be each 
other channel, the first stage is arranged to process the input signals and the or as 
the case may be each subsequent stage is arranged to receive signals processed 
in the respective preceding stage. 

9. A method according to Claim 1 involving a set of n input signals where "n is an 
integer greater than 2, characterised in that it comprises: 

a) producing n(n - 1 )/2 replicas of the set of input signals, 

b) in each replica selecting a respective signal pair differing to that selected in 
other replicas, and 

c) the step of processing the input signals to determine delay and rotation 
parameters being carried out for each replica and comprising: 

i) determining delay and rotation parameters which implement at least 
one elementary paraunitary matrix providing for rotation of the 
respective selected signal pair only, and 

ii) determining which replica when transformed by the associated at 
least one elementary paraunitary matrix gives rise to transformed 
signals corresponding to improvement in a measure of decorrelation 
by at least a major part of a maximum extent obtainable over the 
replicas and designating these transformed signals as output 
signals. 
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10. Computer apparatus for strong decorrelation of signals, the apparatus being 
programmed for reception of input signals, characterised in that the apparatus is 
also programmed to:- 

a) process the input signals to determine delay and rotation parameters which 
implement at least one elementary paraunitary matrix and transform the 
input signals into output signals to obtain improvement in a measure of 
strong decorrelation; 

b) assess the improvement in the measure of strong decorrelation, and if it is 
significant to designate the output signals as input signals and iterate a) and 
b), but if it is hot significant move to step c); 

c) designate the output signals as signals decorrelated in a wide sense. 

11. Apparatus according to Claim 10 characterised in that the delay and rotation 
parameters which transform the input signals characterise a single elementary 
paraunitary matrix. 

12. Apparatus according to Claim 11 characterised in that the computer equipment is 
programmed to produce a paraunitary matrix by cumulatively multiplying 
successive elementary paraunitary matrices produced in iterative processing. 

13. Apparatus according to Claim 11 characterised in that the range of signal delay 
parameters is a set of discrete delay vectors, and the computer equipment is 
programmed to determine the delay and rotation parameters by generating a 
respective version of the input signals delayed by each delay vector in the set, and 
to find for each version rotation parameters which at least approach producing 
maximisation of output signals' strong decorrelation. 

14. Apparatus according to Claim 13 characterised in that the computer equipment is 
programmed to determine the rotation parameters which at least approach 
producing maximisation of output signals' strong decorrelation using a pointwise 
decorrelation algorithm. 

15. Apparatus according to Claim 10 wherein the computer equipment is programmed 
to receive n input signals where n is an integer greater than 2, characterised in that 
the range of signal delay parameters is a set of n-element delay vectors and the 
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range of signal rotation parameters is a set of n(n - 1)/2 angle parameters. 

16. Apparatus according to Claim 10 wherein the computer equipment is programmed 
to receive n input signals where n is an integer greater than 2, characterised in that 
the computer equipment is also programmed to determine delay and rotation 
parameters which implement at least one elementary paraunitary matrix providing 
for rotation of a pair of input signals and relative delay of the or as the case may be 
each other input signal. 

17. Apparatus according to Claim 16 wherein the computer equipment is programmed 
to define respective channels for the n input signals characterised in that the 
computer equipment is also programmed to process the input signals in n(n - 1 )/2 
successive stages each associated with at least one respective elementary 
paraunitary matrix and each providing for rotation of signals associated with a 
respective pair of channels and provision of relative delay associated with the or as 
the case may be each other channel, the first such stage involving processing the 
input signals and the or as the case may be each subsequent stage involving 
processing signals resulting from processing in the respective preceding stage. 

18. Apparatus according to Claim 10 wherein the computer equipment is programmed 
to receive a set of n input signals where n is an integer greater than 2, 
characterised in that the computer equipment is also programmed to: 

a) produce n(n ~ 1)/2 replicas of the set of input signals, 

b) in each replica select a respective signal pair differing to that selected in 
other replicas, and 

c) implement processing of the input signals to determine delay and rotation 
parameters for each replica as input signals and determine: 

i) delay and rotation parameters which implement at least one 
elementary paraunitary matrix providing for rotation of the respective 
selected signal pair only, and 

ii) which replica when transformed by the associated at least one 
elementary paraunitary matrix gives rise to transformed signals 
corresponding to improvement in a measure of strong decorrelation 
by at least a major part of a maximum extent obtainable over the 
replicas and designate these transformed signals as output signals. 
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A computer programme for implementing strong decorrelation of signals input to 
computer apparatus, characterised in that it has instructions for controlling 
computer apparatus to:- 

a) process the input signals to determine delay and rotation parameters which 
implement at least one elementary paraunitary matrix and transform the 
input signals into output signals to obtain improvement in a measure of 
strong decorrelation; 

b) assess the improvement in the measure of strong decorrelation, and if it is 
significant to designate the output signals as input signals and iterate a) and 
b), but if it is not significant move to step c); 

c) designate the output signals as signals decorrelated in a wide sense. 

A computer programme according to Claim 1 9 characterised in that the delay and 
rotation parameters which transform the input signals characterise a single 
elementary paraunitary matrix. 

A computer programme according to Claim 20 characterised- in that that it is 
arranged to control computer equipment to implement the step of producing a 
paraunitary matrix by cumulatively multiplying successive elementary paraunitary 
matrices produced by iterating processing of the input signals to determine delay 
and rotation parameters. 

A computer programme according to Claim 20 characterised in that the range of 
signal delay parameters is a set of discrete delay vectors, and the computer 
programme is arranged to provide for the delay and rotation parameters to be 
determined by generating a respective version of the input signals delayed by each 
delay vector in the set, and for each version finding rotation parameters which at 
least approach producing maximisation of output signals' strong decorrelation. 

A computer programme according to Claim 22 characterised in that it is arranged 
to provide for the rotation parameters which at least approach producing 
maximisation of output signals 1 strong decorrelation to be determined using a 
pointwise decorrelation algorithm. 
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A computer programme according to Claim 19 arranged to control computer 
equipment to receive n input signals where n is an integer greater than 2, 
characterised in that the range of signal delay parameters is a set of n-element 
delay vectors and the range of signal rotation parameters is a.set of n(n-1)/2 
angle parameters. 



25. A computer programme according to Claim 19 arranged to control computer 
equipment to receive n input signals where n is an integer greater than 2, 
characterised in that it is arranged to provide for processing the input signals to 
determine delay and rotation parameters to comprise determining such parameters 
which implement at least one elementary paraunitary matrix providing for rotation 
of a pair of input signals and relative delay of the or as the case may be each other 
input signal. 

26. A computer programme according to Claim 25 arranged to control computer 
equipment to define respective channels for the n input signals characterised in 
that it is arranged to provide for processing of the input signals to determine delay 
and rotation parameters to have n(n - 1)/2 successive stages each associated with 
at least one respective elementary paraunitary matrix and each providing for 
rotation of signals associated with a respective pair of channels and provision of 
relative delay associated with the or as the case may be each other channel, the 
first stage being arranged to process the input signals and the or as the case may 
be each subsequent stage being arranged to receive signals processed in the 
respective preceding stage. 



27. 



A computer programme according to Claim 21 arranged to control computer 
equipment to receive a set of n input signals where n is an integer greater than 2, 
characterised in that it also provides for such equipment to: 

a) produce n(n - 1 )/2 replicas of the set of input signals, 

b) in each replica select a respective signal pair differing to that selected in 
other replicas, and 

c) carry out processing of determine delay and rotation parameters for each 
replica as input signals by: 

i) determining delay and rotation parameters which implement at least 
one elementary paraunitary matrix providing for rotation of the 
respective selected signal pair only, and 
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ii) determining which replica when transformed by the associated at 
least one elementary paraunitary matrix gives rise to transformed 
signals corresponding to improvement in a measure of strong 
decorrelation by at least a major part of a maximum extent 
obtainable over the replicas and designating these transformed 
signals as output signals. 



ABSTRACT 



A method of strongly decorrelating signals comprises processing input signals to 
determine delay and rotation parameters which implement at least one elementary 
paraunitary matrix. The parameters transform the input signals into output signals with 
improvement in a measure of strong decorrelation. The improvement in the measure of 
strong decorrelation is then assessed: if it is significant the latest output signals are 
designated as input signals and the process is iterated. Iteration continues until the 
measure of strong decorrelation ceases to improve significantly, at which point the latest 
output signals are designated as signals decorrelated in a wide sense. 
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